Gpu Acceleration of the Long-wave Rapid Radiative Transfer Model in Wrf Using Cuda Fortran
نویسنده
چکیده
This paper presents the approach and results of porting the Long-Wave Rapid Radiative Transfer Model (RRTM) component of the Weather Research and Forecast (WRF) code to the GPU using CUDA Fortran. After a brief description of the RTTM code, considerations regarding porting the application to the GPU are discussed. Included in the porting discussion are how the data structures have been modified for the GPU architecture, strategies for optimizing data movement, and determining how to partition the code into different kernels and how these kernels are configured. Results for both the original CPU and GPU code are provided in terms of accuracy and speed. Future optimizations using features not currently available in CUDA Fortran will be briefly discussed.
منابع مشابه
GPU-Based Parallel Researches on RRTM Module of GRAPES Numerical Prediction System
GRAPES (Global and Regional Assimilation and Prediction System) is a new generation of numerical weather prediction (NWP) system of China. As the system processes amount of data and requires high real-time,so it is always a hot research field of parallel computing.This is the first time that we use GPU (Graphics Processor Unit) general-purpose computing and CUDA technology on RRTM (Rapid Radiat...
متن کاملGPU Based Acceleration of WRF Model: A Review
The Weather Research and Forecasting model (WRF) is a simulating system developed for atmospheric weather prediction. WRF model is used for both operational as well as research purposes. The need for accurate weather and climate simulation to be carried out in shorter time is increasing day by day, which leads to the acceleration of existing Numerical Weather Prediction (NWP) system. This paper...
متن کاملAccelerating high-order WENO schemes using two heterogeneous GPUs
A double-GPU code is developed to accelerate WENO schemes. The test problem is a compressible viscous flow. The convective terms are discretized using third- to ninth-order WENO schemes and the viscous terms are discretized by the standard fourth-order central scheme. The code written in CUDA programming language is developed by modifying a single-GPU code. The OpenMP library is used for parall...
متن کاملParallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach
There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...
متن کاملAn approach to Improve Particle Swarm Optimization Algorithm Using CUDA
The time consumption in solving computationally heavy problems has always been a concern for computer programmers. Due to simplicity of its implementation, the PSO (Particle Swarm Optimization) is a suitable meta-heuristic algorithm for solving computationally heavy problems. However, despite the simplicity, the algorithm is inefficient for solving real computationally heavy problems but the pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010